A corpus of post-edited translations (Un corpus d'erreurs de traduction) [in French]

نویسندگان

  • Guillaume Wisniewski
  • Anil Kumar Singh
  • Natalia Segal
  • François Yvon
چکیده

A corpus of post-edited translations More and more datasets of post-edited translations are being collected. These corpora have many applications, such as failure analysis of SMT systems and the development of quality estimation systems for SMT. This work presents a large corpus of post-edited translations that has been gathered during the ANR/TRACE project. Applications to the detection of frequent errors and to the analysis of the inter-rater agreement of hTER are also reported. MOTS-CLÉS : Traduction automatique,Analyse d’erreur,Post-édiition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Un système modulaire d'acquisition automatique de traductions à partir du Web

We present a method of automatic translation (French/English) of Complex Lexical Units (CLU) for aiming at extracting a bilingual lexicon. Our modular system is based on linguistic properties (compositionality, polysemy, etc.). Different aspects of the multilingual Web are used to validate candidate translations and collect new terms. We first build a French corpus of Web pages to collect CLU. ...

متن کامل

Identification, Alignment, and Tranlsation of Relational Adjectives from Comparable Corpora (Identification, alignement, et traductions des adjectifs relationnels en corpus comparables) [in French]

RÉSUMÉ Dans cet article, nous extrayons des adjectifs relationnels français et nous les alignons automatiquement avec les noms dont ils sont dérivés en utilisant un corpus monolingue. Les alignements adjectif-nom seront ensuite utilisés dans la traduction compositionelle des termes complexes de la forme [N AdjR] à partir d’un corpus comparable français-anglais. Un nouveau terme [N N�] (ex. canc...

متن کامل

Some translation problems of relational adjectives from French to Wolof: a corpus based study of business terminology (De quelques problèmes de traduction des adjectifs relationnels du français vers le wolof : étude sur corpus de terminologie commerciale) [in French]

The French language, particularly, French for Specific Purpose, uses a lot relational adjectives. These one indicate a certain degree of specialization. In a corpus analysis of 90420 occurrences gathered from dissertations and news paper related to trade field, we tried to constitute a collocation corpus of nouns and adjectives, particularly relational adjectives. A great interest in the proces...

متن کامل

Translating Chinese Romanized Name into Chinese Idiographic Characters via Corpus and Web Validation

Cross-language information retrieval performance depends on the quality of the translation resources used to pass from a user’s source language query to target language documents. Translation lists of proper names are rare but vital resources for cross-language retrieval between languages using different character sets. Named entities translation dictionaries can be extracted from bilingual cor...

متن کامل

(Much) Faster Construction of SMT Phrase Tables from Large-scale Parallel Corpora (Construction (très) rapide de tables de traduction à partir de grands bi-textes) [in French]

Résumé. Dans cet article de démonstration, nous introduisons un logiciel permettant de construire des tables de traduction de manière beaucoup plus rapide que ne le font les techniques à l’état de l’art. Cette accélération notable est obtenue par le biais d’un double échantillonnage : l’un permet la sélection d’un nombre limité de bi-phrases contenant les segments à traduire, l’autre réalise un...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013